Dynamics of DNA translocation through an attractive nanopore 
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We investigate the dynamics of DNA translocation through a nanopore driven by an external 
force using Langevin dynamics simulations in two dimensions (2D) to study how the translocation 
dynamics depend on the details of the DNA sequences. We consider a coarse-grained model of DNA 
built from two bases A and C, having different base-pore interactions, e.g., a strong (weak) attractive 
force between the pore and the base A (C) inside the pore. From a series of studies on hetero-DNAs 
with repeat units AmC„, we find that the translocation time decreases exponentially as a function of 
the volume fraction fc of the base C. For longer A sequences with fc < 0.5, the translocation time 
strongly depends on the orientation of DNA, namely which base enters the pore first. Our studies 
clearly demonstrate that for a DNA of certain length with repeat units AmCn, the pattern 
exhibited by the waiting times of the individual bases and their periodicity can unambiguously 
determine the values of m, n and A'^ respectively. Therefore, a prospective experimental realization 
of this phenomenon may lead to fast and efficient sequence detection technic. 

PACS numbers: 87.15.A-, 87.15.H- 



I. INTRODUCTION 

Polymer translocation through a nanopore is a chal- 
lenging problem in polymer physics and it also plays 
a critical role in numerous biological processes, such as 
DNA and RNA translocation across nuclear pores, pro- 
tein transport through membrane channels, and virus in- 
jection. 

In a seminal experimental paper, Kasianowicz et al. [l| 
demonstrated that an electric field can drive single- 
stranded DNA and RNA molecules through the water- 
filled a-hemolysin channel and that the passage of each 
molecule is signaled by a blockade in the channel current. 
These observations can be directly used to characterize 
the polymer length. Triggered by their experiments and 
various potential technological applications P, 0, such 
as rapid DNA sequencing, gene therapy and controlled 
drug delivery, the polymer translocation has become a 
subject of intensive experimental H, 0, S, S S B S E, 

and theoret- 
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One of the fundamental questions that has been ad- 
dressed in the community is how the translocation time 
T depends on the system parameters and the polymer 
characteristics, such as: chain length TV i5i. iG. i21^. i22i. i24l . 
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A central issue from the point of view of sequencing is 
whether DNA translocation through a nanopore can be 
used to determine the detailed sequence structure of the 
molecule [isj . 

It has been demonstrated experimentally [1, [^ [l^ 
that DNA translocation dynamics is strongly influenced 
by nucleotide-pore interactions. Intuitively, for suffi- 
ciently strong attraction, the residence time of each 
monomer in the pore should increase, resulting in a 
much longer translocation time. In a recent Letter, we 
have investigated the influence of base-pore interaction 
on the translocation dynamics using LD simulations [131 . 
The results show that an attractive interaction increases 
translocation time slowly for weak attraction while an 
exponential increase is observed for strong attraction in 
the activated regime {s^. Under weak driving force and 
strong attractive force, the translocation time shows non- 
monotonic behavior. Our results are in good agreement 
with experimental findings [1, [1, [lB| . 

While a DNA is composed of four different nu- 
cleotides [6lj, to date most of the theoretical treatments 
have focused on scaling and universal aspects of translo- 
cation of a homogeneous polymers, although several ex- 
periments [3, [^ show that in the real biological sys- 
tems inhomogeneities in the structure and interactions 
between polymer and other molecules might have a sig- 
nificant effect on the overall dynamics. Previously, we 
have considered heteropolymcrs consisting of two types 
of monomers labeled A and C, which are distinguished by 
the magnitude of the driving force that they experience 
inside the pore [i^ . This model captured some essential 
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features of the heteropolymer translocation. However, for 
real biopolymers such as DNA and RNA, no charge differ- 
ence between the monomers (nucleotides) exists. Instead, 
the nucleotide-pore interactions are base specific [1, Q. 
Thus a more realistic model for studying the influence of 
structure on biopolymers translocation should differen- 
tiate bases A and C by different base-pore interactions. 
In a recent letter [i^, we have adopted such a model 
and investigated hetero-DNAs with symmetric blocks us- 
ing Langevin dynamics simulations. We found that the 
translocation time depends strongly on the block length 
as well as on the orientation of which base entering the 
pore first. 

In this paper we extend our results for the DNA com- 
posed of symmetric blocks, furnish more general charac- 
teristics of the residence time inside the pore, and provide 
more comprehensive results for the translocation dynam- 
ics of various sequences of a two component hetero-DNA. 
We also provide a simple interpretation for the sequence 
dependence of the monomer waiting time distribution 
and the total translocation time. The paper is organized 
as follows. In section II, we briefly describe our model 
and the simulation technique. In section III, we present 
our results. Finally, the conclusions and discussion are 
presented in section IV. 



II. MODEL AND METHOD 

In our numerical simulations, the polymer chains are 
modeled as bead-spring chains. Excluded volume inter- 
action between monomers is modeled by a short range 
repulsive LJ potential: ULj{r) = 4e[(^)^^ - {ff] + e for 
r < and for r > Here, a is the diameter 

of a monomer, and e is the depth of the potential. The 
connectivity between neighboring monomers is modeled 
as a Finite Extension Nonlinear Elastic (FENE) spring 
with UpENEir) = -^A:i?Qln(l - r'^/R^), where r is the 
distance between consecutive monomers, k is the spring 
constant and Rq is the maximum allowed separation be- 
tween connected monomers. 

We consider a 2D geometry as shown in Fig. [H where 
the walls along the y direction are formed by stationary 
particles within a distance a from each other. The pore 
of length L and width W are formed from two rows of 
stationary particles represented by black circles in Fig. [1] 
Between all monomer- wall particle pairs, there exists the 
same short range repulsive LJ interaction as described 
above. The base-pore interaction is modeled by a LJ po- 
tential with a cutoff of 2.5f7 and interaction strength SpA 
for the base A and Spc for the base C. This interaction 
can be either attractive or repulsive depending on the 
position of the monomer from the pore particles. 

In the Langevin dynamics simulation, each monomer is 
subjected to conservative, frictional, and random forces, 
respectively, with ^ m'r, = -~\7{ULj + UFENE)+Fc^t- 
^Vj-|-F^, where m is the monomer's mass, ^ is the friction 
coefficient, is the monomer's velocity, and Ff is the 



random force which satisfies the fluctuation-dissipation 
theorem. The external force is expressed as Fgxt = Fx, 
where F is the external force strength exerted on the 
monomers in the pore, and x is a unit vector in the di- 
rection along the pore axis. 

The LJ parameters e, a and the bead mass m fix the 
system energy, length and mass units respectively, lead- 
ing to the corresponding time scale Ilj = (mcr^/e)^/^ 
and force scale e/a. In our model, each bead corresponds 
to a Kuhn length of a single-stranded DNA containing 
approximately three nucleotide bases, so the value of 
(7 « 1.5 nm [63|. The average mass of a base in DNA 
is about 312 amu, so the bead mass m w 936 amu. We 
set ksT = 1.2e, so the interaction strength e corresponds 
to 3.39 X 10"2i J at a temperature 295 K. This leads 
to a time scale of 32.1 ps and a force scale of 2.3 pN. 
The remaining dimcnsionless parameters in the model 
are chosen to be i?o = 2, fc = 7, ^ = 0.7. In addition, 
the driving force for a bead in the pore is set as F = 0.5 
unless otherwise stated. This value corresponds to a volt- 
age of about 187.9 mV across the pore (assuming three 
unit charges on a bead and the effective charge 0.094e for 
a unit charge [91 [lO|y, within the range of experimental 
parameters [l|,l2|,U@, [!]■ The pore width W = This 
ensures that monomers A and C encounter an attractive 
force inside the pore [13] ■ The base-pore interactions 
EpA = 3.0 and Epc = 1-0 are chosen based on compari- 
son of the theoretical results [l^] with the experimental 
data f3| for the translocation time distribution histogram 
of poly(dC)ioo and poly(dA)ioo. It is worthwhile to note 
that the translocation time depends strongly on the pore 
length L. We have checked that L « 10 nm produces av- 
erage translocation times t «100 /is in accordance with 
the experimental data Q. For computational efficiency 
we present results for L = 5 (7.5 nm) here. The Langevin 
equation is integrated in time by a method described by 
Ermak and Buckholz [65] in 2D. 

To create the initial configuration, the first monomer 
of the chain is placed in the entrance of the pore. The 
polymer is then allowed to relax to obtain an equilibrium 
configuration. The translocation time is defined as the 
time interval between the entrance of the first bead into 
the pore and the exit of the last bead. The estimate for 
the translocation time was obtained by neglecting any 
failed translocation and then calculating the average du- 
ration of the successful translocation. Typically, we av- 
erage our data over 2000 independent runs with different 
initial conditions. 



III. RESULTS AND DISCUSSIONS 

A. Recognizing position-specific base A 
substitutions in poly(dC) 

As demonstrated by experiments P, a- nanopore 
might be used for DNA sequencing. The most fundamen- 
tal concern has been whether a-hemolysin or any other 
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FIG. 1: A schematic representation of the system. The pore 
length L = 5 and the pore width W = 3. 



natural or synthetic nanopore is capable of recognizing 
DNA with single base resolution. 

To address this question, we first investigate the 
translocation of poly(dC) with a single position-specific 
base A substitutions. Without any loss of generality, we 
present here only results for chain length N = 128, as 
many of the main conclusions here are not dependent on 
the chain length. Fig. 2 shows the waiting time dis- 
tribution for poly(dC3idAdC96). The waiting time of 
the monomer s is defined as the average time between 
the events that monomer s and monomer s + 1 exit the 
pore [13, 0, |4^. It yields more detailed information 
for the translocation process than the overall transloca- 
tion time. Compared with the waiting time distribution 
for Poly(dCi28), the waiting times for s = 26 — 36 are 
different, where one minimum (s = 28) and maximum 
(s = 32) are observed. This result can be easily under- 
stood through the following consideration. The pore of 
length L = 5 can accommodate Upore — 6 monomers on 
the average. When the monomer s = 32 — Upore = 26 
is at the end of the pore ready to exit to the trans side, 
the monomer corresponding to base A has just entered 
the pore through the cis side. The presence of the A 
monomer at this end of the pore reduces the probability 
of the backward motion of the polymer and hence the 
waiting time starts to decrease starting from s = 26 till 
it reaches a minimum at s = 28 when the A monomer is 
about half way inside the pore, after which the waiting 
time of subsequent monomers leaving the pore starts to 
increase until it reaches s = 32 which corresponds to the 
point where the monomer A is at the end of the pore 
ready to exit to the cis side. This maximum in wait- 
ing time results from the activated nature of the escape 
of the monomer A from the pore. After the monomer 
corresponding to base A has translocated, the next few 
monomers still have longer waiting times compared with 
those in a homopolymer Poly(dCi28). This is due to 
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FIG. 2: The waiting time distribution for poly(dC3idAdC96) 
Here £pA = 3.0, Spc ~ 1.0. 
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FIG. 3: The waiting time distribution for 

poly(dC3idA5dC92). Here SpA = 3.0, Spc = 1.0. 



the fact that the attraction between the pore and the 
monomer corresponding to base A induces more back- 
ward events. For other position-specific single base A 
substitutions, we observed similar results. 

For longer base A sequence substitutions such as 
poly(dC3idA5dC92), the results are shown in Fig. 3. 
They can be understood following the same reasoning 
presented above for the single A sequence. We still 
observe one minimum and one maximum between s = 
26 — 36. As expected, the beginning of the decrease 
in waiting time still starts at s = 26 corresponding to 
s = 32 — Upore = 26. The maximum now changes to to 
s — 31-1-5 when the last monomer of type A is at the 
end of the pore ready to exit to the trans side. For other 
sequences such as poly(dC3idA„idC97_,„), the start of 
the downturn at s = 26 and the minimum at s = 28 
remains the same while the maximum will be shifted to 
s = 31 -I- m. Thus, we can conclude that if the waiting 
time distribution can be accurately determined, it allows 
identification of the sequence ^ in a straightforward man- 
ner. 
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FIG. 4: Translocation time for hetero-DNAs with repeat units 
AmCn for different values of m and n as a function of the 
volume fraction of the base C. Here SpA ~ 3.0, Epc ~ 1-0, 
and the chain length A*' — 128. 
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FIG. 5: r as a function of the different composition for repeat 
units AmCn- Here, we define r as the ratio of translocation 
times for base C entering the pore first to base A entering the 
pore first. Here, e^a ~ 3.0, Spc = 1-0, and the chain length 
TV = 128. 



B. Effect of the different composition on the 
translocation 

In this section, we present results for the transloca- 
tion dynamics for hetero-DNAs with repeat units AmCn, 
where m and n are the length of the bases A and C in 
a single unit respectively. We define m + n as the block 
length of the polymer, and the volume fraction of the C 
component as /c = n/ (m + n). In the translocation dy- 
namics of these hetero-DNAs, it turns out that we need 
to distinguish the two different cases where the base A 
or the base C first enters the pore, respectively. 

We first examine short repeat units with m + n < 8 
for the whole range of volume fractions fc- The results 
are shown in Fig. 4. Because the attraction between 
the pore and the base A is stronger, the total transloca- 
tion time TA for poly (dA) 128 {fc — 0)is longer than the 
corresponding value tc for a poly (dC) 128 as found in our 
previous work (47| . The solid curve in Fig. 4 represents a 
simple average of the translocation time for poly (dA) 128 
and poly(dC)i28 given by t(/c) = ta{1 - fc) + Tcfc- 
It can be seen that the actual results for the translo- 
cation time of hetero-DNAs with short repeat units are 
always lower than this average, indicating a correlation 
effect between different monomer blocks. As expected, 
the translocation time decreases with increasing fc- For 
™ ^ 3 {fc > 0.6), the translocation time is almost inde- 
pendent on the orientation of which base enters the pore 
first. This is because the pore with length L = 5 can ac- 
commodate more than 3 monomers. Thus, for m < 3, the 
translocation times for the hetero-DNAs are determined 
largely by the configurations where all the A monomers 
are trapped inside the pore, independent of whether the 
base A or C enters the pore first. In this regime, the 
translocation time can be fitted empirically by the for- 
mula T{fc) = Tce^^~^'^'^ - 



For small values of fc < 0.4 (m > 5), the transloca- 
tion time T does depend on the orientation, with r being 
larger for the case where the base C enters the pore first. 
This asymmetry is due to the fact that the transloca- 
tion time is now controlled by the last block A exiting 
the pore. When this is not followed by a block C, the 
activation barrier for the exit of the last block is large, 
leading to a much longer total translocation time. In this 
regime, r shows faster exponential decay with increasing 
fc when the base A entering the pore first. To quantify 
the asymmetry, we define r as the ratio of translocation 
times for the case where base C enters the pore first to 
the case where A enters the pore first. Fig. 5 shows r 
for different sequences. It is clear that the asymmetry 
increase with the length of the base A sequence. 

Compared to the total translocation time, the waiting 
time for individual bases yields more information for the 
detailed sequence structure as shown in the last section. 
To show how the waiting time can be used to read a pe- 
riodic sequence, we show the result for poly(dA5dC3)i6 
in Fig. 6. There are several features worth noting. First, 
the waiting time of the last block A is much longer than 
the rest, due to the activated nature of the last block A. 
This effect is more pronounced for C entering the pore 
first, since it leads to a much larger activation energy for 
the exit of the last block. For the orientation in which 
A enters the pore first, both the monomers in the last 
block A and the last block C exhibit longer waiting times, 
but not as strongly as the other orientation as the corre- 
sponding activation energies are smaller. The orientation 
of DNA can thus be differentiated in this case. Second, 
the sequence lengths m = 5 for the base A and n — i for 
the base C are clearly distinguished in comparison with 
the waiting time distributions of their homopolymers. 

Moreover, we find that the waiting times for the or- 
dered DNA with repeat units AmCn exhibit "fringes" 
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FIG. 6: The waiting time distribution for poly(dA5dC3)i6. 
Here, e^a = 3.0, Epc = 1.0, and the chain length A'' = 128. 



reminiscent of optical interference pattern. This is due 
to the fact that near the end of exit of any block A, the 
waiting time increases to a maximum because of the in- 
crease in the activation energy. Thus the number of peaks 
is exactly equal to N/ (m + n), which is the periodicity of 
the sequences (cf. Fig. 6). 



C. Translocation time as a function of the chain 
length for different sequences 

Previously, we observed that the translocation time of 
homopolymers with pure repulsive interaction with the 
pore scales as r oc N'^'^ with the chain length for rela- 
tively short chains {N < 200) '4^l4i|. where v = 0.75 is 
the Flory exponent in 2D 66, |67[ . For heteropolymers 
consisting of two types of monomers that have a pure re- 
pulsive interaction with the pore and distinguished only 
by the magnitude of the driving force that they experi- 
ence inside the pore, we find that these scaling properties 
remain valid with arbitrary repeat unit [46l|. This could 
be easily understood by noting that at a higher level of 
coarse-graining, the microstructure of the chain is irrele- 
vant as far as universal scaling properties are concerned. 
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FIG. 7: Translocation time as a function of the chain length 
for different sequences. Here, SpA =3.0, and Spc = 1.0 



When attractive interaction is introduced between the 
monomers and the wall of the pore, we have found re- 
cently that while the scaling r oc N'^'^ is still valid for 
poly(dC) with the weaker attraction, the translocation 
time T shows non-monotonic behavior as a function of 
the chain length for poly(dA) due to the strong attractive 
force between poly (d A) and the pore [13]. This leads to 
an interesting question on the translocation time r scales 
with N for different sequences AmCn in heteropolymers 
considered in this work. In Fig. 7 we plot the results 
showing how the translocation time scales as a function 
of chain length N for different repeat structures A1C3, 
A2C2, A3C1, and AqC2- For A3C1 and AqC2, the translo- 
cation time r depends on the orientation, but the orien- 
tation does not affect the qualitative dependence of t on 
N. 

For A1C3, the scaling exponent is 1.21 which is close 
to 21^. However, with increasing block length of the base 
A, the effective scaling exponent decreases. For A2C2, 
it is 0.96 and it is less than 0.30 for A3C1, AqC2. The 
novel dependence on the length of DNA is due to the 
change from the non-activated regime for weak attrac- 
tive or pure repulsive interaction to an activated regime 
for strong attractive interaction. It would be desirable 
to have measurement made over a larger range of com- 
position to critically test our predictions for the effective 
scaling exponent. 



IV. CONCLUSIONS 

In this work, we have investigated the dynamics of 
DNA translocation through a nanopore driven by an 
external force using 2D Langevin dynamics simulations 
with an aim to understand how various aspects of the 
translocation dynamics is affected by a specific sequence. 
We have We have considered coarse-grained models of 
hetero-DNAs consisting of two types of bases A and C, 
which are differentiated by the different base-pore inter- 
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actions, such as a strong attractive force between the 
pore and the base A and a weaker attractive force be- 
tween the pore and the base C. From a scries of studies 
on polymers with sequences AmCn, we find the expo- 
nential dependence of the translocation time on fc , the 
volume fraction of the base C. For longer sequences with 
fc < 0.5 the translocation time strongly depends on the 
orientation of DNA, namely the condition which base en- 
ters the pore first. These results arc interpreted accord- 
ing to the waiting times of the individual bases. We find 
that the waiting time displays intriguing fringe patterns 
that can be understood from the process of emptying the 
more attractive subblocks from the pore. This also leads 
to a very simple qualitative understanding of the orien- 
tational dependence of the translocation time depending 
on which base enters the pore first. Furthermore, based 
on the waiting time distribution, the sequence lengths m, 
n and their periodicity can be distinguished. 

One may wonder to what extent our studies are rel- 
evant to bringing this observations into practice and to 



develop a "fast sequencing machine"? Among the most 
important conclusions here is that the waiting time dis- 
tribution of individual monomers can be used to iden- 
tify sequences in heteropolymers. It is worth noting in 
this context that the ionization potentials for different 
polynucleotides are different. Therefore, it is plausible 
that laser-induced attractive interactions along with fiu- 
orescence spectroscopy could be used to make a device 
that will detect cither the waiting time or the number of 
monomers inside the pore as a function of time. 
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